KLEAT: Cleavage Site Analysis of Transcriptomes
نویسندگان
چکیده
In eukaryotic cells, alternative cleavage of 3' untranslated regions (UTRs) can affect transcript stability, transport and translation. For polyadenylated (poly(A)) transcripts, cleavage sites can be characterized with short-read sequencing using specialized library construction methods. However, for large-scale cohort studies as well as for clinical sequencing applications, it is desirable to characterize such events using RNA-seq data, as the latter are already widely applied to identify other relevant information, such as mutations, alternative splicing and chimeric transcripts. Here we describe KLEAT, an analysis tool that uses de novo assembly of RNA-seq data to characterize cleavage sites on 3' UTRs. We demonstrate the performance of KLEAT on three cell line RNA-seq libraries constructed and sequenced by the ENCODE project, and assembled using Trans-ABySS. Validating the KLEAT predictions with matched ENCODE RNA-seq and RNA-PET libraries, we show that the tool has over 90% positive predictive value when there are at least three RNA-seq reads supporting a poly(A) tail and requiring at least three RNA-PET reads mapping within 100 nucleotides as validation. We also compare the performance of KLEAT with other popular RNA-seq analysis pipelines that reconstruct 3' UTR ends, and show that it performs favourably, based on an ROC-like curve.
منابع مشابه
Pathogenicity and haemagglutinin gene sequence analysis of Iranian avian influenza H9N2 viruses isolated during (1998–2001)
Sixteen avian influenza (AI) H9N2 viruses were isolated from disease outbreaks in different parts of Iranduring (1998–2001). These AI isolates were used for pathogenicity, haemagglutinin (HA) gene variation andphylogenetic analysis. Results in both pathogenicity tests and HA gene cleavage site sequence detectionrepresented a non-highly pathogenic feature for all Iranian AI isolates studied. The...
متن کاملTranscriptional activity regulates alternative cleavage and polyadenylation
Genes containing multiple pre-mRNA cleavage and polyadenylation sites, or polyA sites, express mRNA isoforms with variable 3' untranslated regions (UTRs). By systematic analysis of human and mouse transcriptomes, we found that short 3'UTR isoforms are relatively more abundant when genes are highly expressed whereas long 3'UTR isoforms are relatively more abundant when genes are lowly expressed....
متن کاملGenetic Analysis of Avian Orthoavulavirus Type I (AOAV-1) Strains Isolated from Broiler Flocks
Background and aim: Newcastle disease (ND) is one of the most serious diseases among many species of birds and causes devastating effects on the poultry industry. This disease is endemic in Iran and ND outbreaks occur unexpectedly and with high mortality and severe clinical signs. The sequence of the F protein cleavage site is that the major virulence determinant of Newcastle disease virus (NDV...
متن کاملmicroRNA-directed cleavage and translational repression of the copper chaperone for superoxide dismutase mRNA in Arabidopsis.
microRNA398 (miR398) is a conserved miRNA of plants that targets two of the three copper/zinc superoxide dismutases (SOD) of Arabidopsis (CSD1 and CSD2) by triggering cleavage or inhibiting translation of their mRNAs. We analysed the transcriptomes of mutants impaired in miR398 production, and found that the mRNAs encoding the copper chaperone for superoxide dismutase (CCS1), which delivers cop...
متن کاملAmino Acid Sequence Analysis of Hemagglutinin Protein of H9N2 Isolated from Broilers in Tehran in 2007
Background and Aims: Since 1998, Iranian poultry industry has been affected by avian influenza (AI) virus, subtype H9N2. The association of high mortality and case report of H5N1 and H9N2 influenza virus in wild birds in recent years raised the suspicion of a possible new genetic modified AI virus. Methods: Partial nucleotide sequences and deduced amino acid of hemagglutinin (HA) genes of 4 H9...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
دوره شماره
صفحات -
تاریخ انتشار 2015